INTRODUCTION 


WHAT IS PARIS? 


CONNECTION MACHINE "MACROINSTRUCTION SET" 
EQUIVALENT TO SERIAL MACHINES ASSEMBLY LANGUAGE. 
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VIRTUAL MACHINE ARCHITECTURE 


SCALABLE ARCHITECTURE 


VIRTUAL PROCESSORS, ONE PHYSICAL PROCESSOR SIMULATES A NUMBER 
OF VIRTUAL PROCESSORS 
TWO MECHANISMS FOR INTERPROCESSOR COMMUNICATION 
- ROUTER -- ANY PROCESSSOR CAN SEND DATA TO ANY OTHER PROESSOR 
- NEWS GRID -- TWO DIMENSIONAL GRID. DATA CAN BE SENT ON TWO 
DIMENSIONS: NORTH, EAST, WEST AND SOUTH. 
RICH VIRTUAL INSTRUCTION SET 
- ARITHMETIC ON INTEGER 
- ARITHMETIC ON FLOATING POINT NUMBERS (FAST) 
- INTERPROCESSOR COMMUNICATIONS 
- COOPERATING COMPUTATIONS 


- PER-PROCESSOR STACK 
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VIRTUAL PROCESSOR ORGANIZATION 


ONE PHYSICAL PROCESSOR SIMULATES MANY VIRTUAL PROCESSORS 
EACH PHYSICAL PROCESSOR SIMULATES A SMALL TWO-DIMENSIONAL GRID. 


ONE KIND OF PROCESSOR ADDRESSING IS VIRTUAL CUBE ADDRESS, WHERE 
EACH VIRTUAL PROCESSOR HAS ITS ADDRESS LAID OUT IN THE FOLLOWING 


FASHION: 
PHYSICAL PROCESSOR 


VIRTUAL CUBE ADDRESS 


VY AND VX REPRESENT THE X AND Y COORDINATES OF THE PROCESSOR ON 
THE GRID. ADDRESSING STARTS AT UPPER LEFT WITH Y INCREASING 
TOWARD SOUTH AND X INCREASING TOWARD EAST. 


ANOTHER KIND OF PROCESSOR ADDRESSING IS VIRTUAL MEMORY ADDRESS 
WHERE EACH VIRTUAL PROCESSOR HAS ITS ADDRESS LAID OUT IN THE 
FOLLOWING FASHION:: 


MEMORY ADDRESS 
PHYSICAL PROCESSOR WITHIN PROCESSOR 


VIRTUAL MEMORY ADDRESS 


Digitized by the Internet Archive 
in 2023 with funding from 
Kahle/Austin Foundation 


https://archive.org/details/parislectureOOunse 


Sorting 
The Enumerate Pack Sort 


( defun simple-sort ( field length ) 
;, create temporary variables 
( let (( total-space ( + cm:*cube-address-length* length)) 
( addresses ) 
( send-buf )) 


;; allocate space on the stack for addresses 

( setq addresses ( cm:push-space total-space )) 

;; allocate space on the stack for send-buf 

( setq send-buf ( + addresses cm:*cube-address-length* )) 


;; start the sort loop, this will sort the whole field for length 
( loop for bit from field below ( + field length ) 
do 


;; enumerate processors with zero in bit 
aap | cm:lognot-always cm:context-flag bit 1 ) 
( cm:enumerate addresses cm:*cube-address-length”* ) 


;; count processors with zero in bit 
(let ((num-zero-procs (cm:global-count cm:context-flag))) 


;; enumerate processors with one in bit 
(cm:move-always cm:context-flag bit 1 ) 
(cm:enumerate addresses cm:*cube-address-length*) 


;; create addresses of processors that will exhange data 

;; add number of processors with zero in bit to 

;; final addresses of processors with one in bit 

(cm:u+constant addresses num-zero-procs 
cm:*cube-address-length* ) 


;; select all processors 

;; swap the fields of all selected processors 

( cm:move-constant-always cm:context-flag 11 ) 
(cm:send send-buf addresses field length ) 
(cm:move field send-buf length ))) 


;; reset stack pointer 
(cm:pop-and-discard total-space ))) 


Sorting 
The Enumerate Pack Sort 


The algorithm takes a key in each processor and sorts them. For 
each bit of the key it executes the following steps: 


processor number = Ont? .3 45.6 7-] 
value (key) = homo 14272} 
first bit of key = Dye 100s 0:] 
First Step 

processor number = POR 2 S405 O75} 
selected processors with Oinfirstbit= [xxxx11x1] 
enumerate creates addresses = LX cxexak ek 


processor number = [O01 

selected processors with 1 infirst bit= [111 
enumerating creates addresses = POle2s kx 4x7 
add how many 0 bit processors = [345 


before send 
processor number = [01 
3 


addresses = i 3 
data = [5 
after send 

processor number = [ 
data = [ 
Second Step 


data = [Az l2e7 37 


Third Step 


data = 12284577) 
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Sorting 
The Enumerate Pack Sort 


33; Goal: Take a key in each processor and sort them. 


;;; Assumptions: All processors are selected. 


535 
33; Algorithm for parallel sort: 
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;;; Loop for i from starting-location below 
(+ starting-location length ) 


. Select processors with /th bit being zero. 


Enumerate these processors, this will create a pointer 
to unselected processors. 


. Count the number of processors with /th bit being zero. 


. Select processors with /th bit being one. 


Enumerate those processors. 


. Add to the enumerated address of processors with /th 


bit being one a count of number of processors with ith 
bit being zero. Now processors with /th bit being one 
point to processors with ith bit being zero. And 
processors with /th bit being zero point to processors 
with /th bit being one. 


. Select all processors. 


Now send data from processors with /th bit being zero 
to processors with /th bit being one. And at the same 
time, send data from processors with ith bit being one 
to processors with /th bit being zero. 
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Address Transformation Instructions 


CM:my -cube-address 
-x-address 
-y-address 

Arguments: 

destination 


CM:x-from-cube 
CM:y-from-cube 


Arguments: 
destination source 


CM:front-end-x-from-cube 
CM:front-end-y-from-cube 


Arguments: 
cube-address 


CM:cube-from-x-y 


Arguments: 
destination x-source y-source 


CM:front-end-cube-from-x-y 


Arguments: 
x-address y-address 
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INSTRUCTION TYPES 


o ARITHMETIC 

o LOGICALS 

o COMPARES 

© CONVERSION 

© © FEST 

0 COMMUNICATION 

© COOPERATIVE COMPUTATION 


o DATA MOVEMENT 
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-Signed Integers 
CM:+ 
constant 
carry 
flags 
Arguments: 


destination source length 
destination source-constant length 


CM:add 


Arguments: 
dest sourcel source2 dlen slenI slen2 


-Unsigned Integers 
CM:u+ 
constant 


carry 
flags 


Arguments: 
destination source length 
destination source-constant length 


CM:unsigned-add 


Arguments: 
dest sourcel source2 dlen slen1 slen2 


-Floating-Point Numbers 
CM:f+ 


Arguments: 
destination source signif-len expt-len 
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-Signed Integers 
CM:- 
constant 
borrow 
Arguments: 


destination source length 
destination source-constant length 


CM:subtract 


Arguments: 
dest sourcel source2 dlen slenI slen2 


-Unsigned Integers 
CM:u- 
constant 
borrow 


Arguments: 
destination source length 
destination source-constant length 


CM:unsigned-subtract 


Arguments: 
dest sourcel source2 dlen slenl slen2 


-Floating-Point Numbers 
CM:f- 


Arguments: 
destination source signif-len expt-len 


Multiplication 


-Signed Integers 
CM:* 


Arguments: 
destination source length 


CM:multiply 


Arguments: 
dest sourcel source2 dlen slenI slen2 


-Unsigned Integers 
CM:u* 


Arguments: 
destination source length 


CM:unsigned-multiply 


Arguments: 
dest sourcel source2 dlen slen1 slen2 


-Floating-Point Numbers 
CM:f* 


Arguments: 
destination source signif-len expt-len 
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Division 


-Signed Integers 
CM:floor-divide 
CM: ceiling-divide 
CM:truncate-divide 
CM:round-divide 


Arguments: 
dest sourcel source2 dlen slen! slen2 


CM:mod 
CM:rem 


Arguments: 
destination source length 


CM: floor-and-mod 

CM: ceiling-and-remainder 
CM:truncate-and-rem 
CM:round-and-remainder 


Arguments: 
destl dest2 sourcel source2 dlen slenI slen2 
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ADDRESSING MODES 


- ABSOLUTE 
ABSOLUTE ADDRESS IS USED TO ACCESS 
ANY LOCATION IN MEMORY. 


“| *user-memory-address-limit* 


a 
f 


virtual 
STACK processor 0 
STACK ADDRESSING IS USED TO ACCESS DATA 


ON THE STACK. ONE GLOBAL STACK POINTER. 


CM:*user-memory-address-limit* 


| stack 


virtual 


stack-pointer processor 0 
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-Unsigned Integers 
CM:unsigned-floor-divide 
CM:unsigned-ceiling-divide 
CM:unsigned-truncate-divide 
CM:unsigned-round-divide 


Arguments: 
dest sourcel source2 dlen slenl slen2 


CM:unsigned-mod 
CM:unsigned-rem 


Arguments: 
destination source length 


CM:unsigned-floor-and-mod 
CM:unsigned-ceiling-and-remainder 
CM:unsigned-truncate-and-rem 
CM:unsigned-round-and-remainder 


Arguments: 
destl dest2 sourcel source2 dlen slenI slen2 


-Floating-Point Numbers 
CM:f/ 


Arguments: 
destination source signif-len expt-len 
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-General Communication using router 


CM:send 
CM:send-with 
-overwrite 
-logior 
-logand 
-logxor 
-add 
-max 
-min 
-unsigned-max 
-unsigned-min 
CM: get 


Arguments: 
destination cube-address source lengthedimenlinistn 


Uses Virtual Processor address. 
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neral Communication usin 


CM:store 
CM:store-with -overwrite 
-logior 
-logand 
-logxor 
-add 
-max 
-min 
-unsigned-max 
-unsigned-min 
CM:fetch 


Arguments: 
memory-address source length time-limit 


Uses Virtual Memory address. 
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FLOATING POINT NUMBERS 


CM:*user-memory-address-limit* | 


virtual 
processor 0 


FLOATING POINT NUMBER - IS SPECIFIED BY THREE PARAMETERS: 
a - BIT ADDRESS a 
s - SIGNIFICANT LENGTH 


e - EXPONENT LENGTH 
TOTAL LENGTH OF BITS IN REPRESENTATION IS s +e + 1. 


MANY PRECISIONS ARE SUPPORTED 

e CAN BE >2 UP TO CM:*MAXIMUM-EXPONENT LENGTH 

s CAN BE >1 UP TO CM:*MAXIMUM-SIGNIFICANT LENGTH 
FOR JEEE SINGLE, s = 23 AND e=8 

IEEE DOUBLE, s = 52 AND e=11 


ts 


*tinsit-eeoibbha-¢ Om omi+ 198! 
L+aere8 


a ~~. 7 


soaurta? # 


y° 4obe9907q 


QS ABA SRT 76 CSP A. MMU TAT TA : : 
¢ S2BROOA TH +e 7 

WT OVE THADRIMON, 8» 

TOLL THON =p 

i os» 20 MGITATVIELAR Mi ZT IO 


CCRC HAA | 

ary etree ena NO a 9 
ra ioseaintniademetagaptnesnis! inet . a 
san MA Baas | 

ites HA Steud 


i c 


EMORY L_ PROCE R 


ADDRESSABLE TO INDIVIDUAL BITS 


WITHIN EACH VIRTUAL PROCESSOR THE BIT ADDRESSES RUN FROM 
0 (INCLUSIVE) TO SOME LIMIT n (EXCLUSIVE). 


MEMORY CAN BE USED FOR ANYTHING 


STACK IN EVERY VIRTUAL PROCESSOR, BUT ONLY ONE GLOBAL STACK 
POINTER. STACK GROWS FROM TOP OF MEMORY DOWNWARD. 


FOUR FLAGS IN EACH VIRTUAL PROCESSOR: 


CONTEXT-FLAG -- INDICATES WHICH PROCESSOR IS ACTIVE 
THE OPERATION IS EXECUTED ONLY IN THOSE PROCESSORS WHOSE 
CONTEXT-FLAG IS 1. ALMOST ALL PARIS OPERATIONS ARE "CONDITION. 


OVERFLOW-FLAG -- INDICATES WHICH OPERATION PRODUCED A 
RESULT TOO LARGE TO FIT INTO DESTINATION.. 


CARRY-FLAG -- HOLDS THE CARRY-IN AND CARRY-OUT FOR AN 
ARITHMETIC OPERATION. 


TEST-FLAG -- HOLDS THE RESULT OF NUMERIC COMPARISONS 
AND OTHER TESTS. 
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DATA TYPES 


BIT: A SINGLE BIT QUANTITY 

BITFIELD: A GROUP OF n CONTIGUOUS BITS 
SIGNED INTEGERS 

UNSIGNED INTEGERS 


FLOATING-POINT NUMBERS, IEEE FORMAT 
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BIT FIELD 


CM:*user-memory-address-limit* : 


=——— 


memory a virtual 
processor 0 


BIT FIELD - POINTED TO BY a AND OCCUPIES 
ADDRESSES FROM a TO a+ Nn - I(INCLUSIVE) 
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SIGNED INTEGER 


— memory-address-limit* 


aS « 
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processor 0 


SIGNED INTEGER - 2'S COMPLEMENT FORMAT 
SIGN INTEGER OF LENGTH n CAN TAKE VALUES 
IN RANGE - (2'""'} THROUGH 2"! -1. THE SIGN 
BIT IS THE MOST SIGNIFICANT BIT. 


sADATAL Ovo 


° *sim))-erethba-¢3emom- ten 


“SIN 1G 


ZHU AY THAT ADO RIELS 
yoia SUT 4 easement 


TALE TWGRAEL POD @& - wa a 
meen 


UNSIGNED INTEGER 


CM:*user-memory-address-limit* : 


=== 


memory a virtual 
processor 0 


UNSIGNED INTEGER - SAME AS A BIT FIELD. 
VALUES CAN BE IN THE RANGE OF 0 THROUGH 2 -1. 
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-Communication through the News Gri 


CM:get-from -north 
-east 
-west 
-south 
-north-always 
-east-always 
-west-always 
-south-always 


Arguments: 
destination source length 


Uses X and Y addresses. 
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